Performance improvement in Resampling Based Clustering
نویسندگان
چکیده
منابع مشابه
Resampling-based selective clustering ensembles
Traditional clustering ensembles methods combine all obtained clustering results at hand. However, we observe that it can often achieve a better clustering solution if only part of all available clustering results are combined. This paper proposes a novel clustering ensembles method, termed as resampling-based selective clustering ensembles method. The proposed selective clustering ensembles me...
متن کاملData Resampling for Path Based Clustering
Path Based Clustering assigns two objects to the same cluster if they are connected by a path with high similarity between adjacent objects on the path. In this paper, we propose a fast agglomerative algorithm to minimize the Path Based Clustering cost function. To enhance the reliability of the clustering results a stochastic resampling method is used to generate candidate solutions which are ...
متن کاملResampling for Fuzzy Clustering
Resampling methods are among the best approaches to determine the number of clusters in prototype-based clustering. The core idea is that with the right choice for the number of clusters basically the same cluster structures should be obtained from subsamples of the given data set, while a wrong choice should produce considerably varying cluster structures. In this paper I give a brief overview...
متن کاملPerformance Improvement for Frequent Term-based Text Clustering Algorithm
Frequent term-based text clustering [2] is a recently introduced text clustering technique, which uses frequent term sets and dramatically decreases the dimensionality of the document vector space, thus especially addressing itself to the problems of text clustering: very high dimensionality of the date and very large size of the databases [2]. Moreover, frequent term sets provide understandabl...
متن کاملReliability and Availability Improvement in Economic Data Grid Environment Based On Clustering Approach
Abstract - One of the important problems in grid environments is data replication in grid sites. Reliability and availability of data replication in some cases is considered low. To separate sites with high reliability and high availability of sites with low availability and low reliability, clustering can be used. In this study, the data grid dynamically evaluate and predict the condition of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: MATICS
سال: 2020
ISSN: 2477-2550,1978-161X
DOI: 10.18860/mat.v12i1.8918